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Abstract 

Appropriate regulation of gene expression is essential to ensure that protein synthesis occurs 
in a selective manner. The control of transcription is the most dominant type of regulation 
mediated by a complex of molecules such as transcription factors. In general, regulatory 
molecules are of two types: activator and repressor. Activators promote the initiation of 
transcription whereas repressors inhibit transcription. In many cases, they regulate the gene 
transcription on binding the promoter mutually exclusively and the observed gene expression 
response is either graded or binary. In experiments, the gene expression response is quantified 
by the amount of proteins produced on varying the concentration of an external inducer 
molecules in the cell. In this paper, we study a gene regulatory network where activators 
and repressors both bind the same promoter mutually exclusively. The network is modeled 
by assuming that the gene can be in three possible states: repressed, unregulated and active. 
An exact analytical expression for the steady-state probability distribution of protein levels 
is then derived. The exact result helps to explain the experimental observations that in the 
presence of activator molecules the response is graded at all inducer levels whereas in the 
presence of both activator and repressor molecules, the response is graded at low and high 
inducer levels and binary at an intermediate inducer level. 



PACS number(s): 87.10.Mn 

I. INTRODUCTION 

Gene expression, a fundamental cellular process whereby mRNAs and proteins are synthesized, 
is inherently stochastic in nature. There is a large number of theoretical and experimental studies 
which confirm the stochastic nature of gene expression [lj. The stochasticity or noise in gene 
expression is due to the small number of molecules involved in the associated cellular processes. 
For example, the DNA molecule which gives an organism its unique genetic identity is present in 
one or two copies per cell. The small number of molecules taking part in the biochemical events of 
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gene expression is responsible for the probabilistic occurrence of the events. The stochastic nature 
of the biochemical events introduces fluctuations around the mean mRNA and protein levels. The 
fluctuations constitute noise and cause identical copies of a gene to express at different levels. The 
total noise in the gene expression level has two components: intrinsic and extrinsic. The origin of 
intrinsic noise lies in the probabilistic nature of the biochemical events of gene expression. The 
sources of extrinsic noise is in the fluctuations in cellular components such as RNAPs, ribosomes 
and regulatory molecules. The noise in gene expression may give rise to heterogeneity in a cell 
population. Cell-to-cell variability is generally attributed to genetic differences though the envi- 
ronment and history are also contributing factors. Recent experiments [2j [3j provide evidence that 
stochasticity in gene expression can contribute substantially to population heterogeneity and con- 
sequent variability in the cellular phenotype. A population of cells with identical genetic sequences 
as well as history and subjected to the same constant environment can develop heterogeneities due 
to the random nature of gene expression. Cellular heterogeneity has been observed in a variety of 
cell types ranging from bacteria [5J to complex mammalian cells [6j. Several experiments combined 
with theoretical studies provide important new insight on the stochastic aspects of gene expression 

|2l El Eld El E]. 

Gene expression and its regulation are of fundamental importance in living organisms. There 
are many steps in gene expression pathway from DNA to proteins and different types of regula- 
tory molecules are involved in different steps. In general, transcriptional regulation is one of the 
most dominant types of regulation. The activator and repressor molecules are actively involved in 
the regulation of gene transcription both in prokaryotes and eukaryotes. Transcriptional repressors 
such as lac and tryptophan repressors are well known for prokaryotic systems. Repressor molecules 
inhibit the gene transcription by binding to the appropriate region of the DNA. Eukaryotic systems 
are much more complex and have compact chromatin structures. For the initiation of transcrip- 
tion, remodelling of the chromatin structure is essential so that the transcription factors and the 
RNA polymerase have access to the appropriate binding regions. Thus, gene activation in eukary- 
otic system means the relief of repression by the nucleosomal structure of the chromatin. After 
remodelling of chromatin structure, activator protein binds the DNA and activate gene expression. 
Activator protein concentrations can be varied by varying the inducer molecules such as galactose 

Experiments reveal that in an individual cell the gene expression response, the amount of 
proteins synthesized, can be of two types: graded and binary. In graded response protein level 
varies continuously with varying concentration of external inducer molecules. In binary response, 
protein levels can have two possible values: low or high. This is also known as the all-or-none 
phenomenon in gene expression. The binary response at the single cell level gives rise to a bimodal 
distribution in protein levels at the population level. There are experimental evidences of binary 
responses in gene expression with different possible origins [31 [TUl dU H21 E2 El]- Becskei et al. 
[10J have demonstrated that positive feedback with cooperativity can generate binary response in 
a synthetic eukaryotic gene circuit but without the positive feedback the response is graded. The 
presence of positive feedback loop with cooperativity gives rise to bistability and the bistability 
along with stochasticity produces binary response in protein levels. Recently, Tan et al. [11] have 
established that bistability may also arises from the interplay between a non-cooperative positive 
feedback loop and circuit-induced growth retardation. Blake et al. [3] and Karmakar and Bose [4J 
have shown that fluctuations in the levels of transcription factor can give rise to binary responses 
in the target gene expression in an eukaryotic system. Rossi et al. [12] and Biggar and Carbtree 
[13] have further shown that, in certain instances, competition between activator and repressor 
molecules to occupy the promoter region can generate a binary response in gene expression. If the 
activator or repressor molecules act independently, a graded response is obtained. The difference 
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in the cellular fates in binary response may be ascribed to heterogeneity in the distribution of the 
stimulus/inducer molecules in the cell population, different histories, i.e., initial states in the case 
of bistability, intracellular noise giving rise to fluctuations in key parameter values etc. 

To explain the experimentally observed binary responses in gene expression, different modeling 
approaches have already been proposed and analyzed using simulation and analytical techniques. 
Binary response in an autocatalytic induction circuit is very common and easily understood from 
different theoretical studies [T5l HE] . Noise can have important role in the generation of binary 
responses in gene expression. It may be of purely stochastic origin |TT] [T8| [T9] . Kepler and Elston 
1 17] have demonstrated through specific examples that only stochasticity in gene expression can 
give rise to binary response, i.e., a bimodal distribution in the protein levels. Pirone and Elston 
[18J show that the slow promoter transition in gene states is responsible for binary responses 
whereas fast transitions produce graded responses. Karmakar and Bose [19] defined the slow and 
fast transitions between the active and inactive states of the gene more precisely and established 
the conditions of origin of graded and binary responses in gene expression. They derived the 
distribution of protein levels assuming the random transitions between the gene states with protein 
synthesis and degradation occurring deterministically. Later, exact analytical distributions for 
mRNAs and proteins have been derived considering all the major steps of gene expression i.e., 
transcription, translation and degradation, to be stochastic [20, 21j. In this paper, we propose a 
simple model of stochastic gene transcription regulated by activators and repressors and show using 
exact analytical calculations that bimodal distribution in protein levels appears naturally when 
activators and repressors compete for the binding site mutually exclusively to regulate the gene 
transcription. On the other hand, a graded response is observed when only activator molecules 
regulate the gene transcription. 

II. STOCHASTIC MODEL AND EXACT SOLUTION 

Transcriptional regulation by activator and repressor molecules on binding the same promoter is 
an important regulatory mechanism of gene expression in living organisms. The activator (repres- 
sor) molecules activate (inhibit) the transcription by binding the appropriate site on the promoter. 
Here we consider a gene regulatory network where activators and repressors both regulate the gene 
transcription mutually exclusively [12]. This can happen in different ways and one such way may 
be the overlapping binding sites on the promoter (Fig. 1). Therefore, the activator and repressor 
molecules cannot bind the promoter simultaneously, rather they compete for their binding sites to 
regulate gene transcription. This mechanism of transcriptional regulation is represented by a sim- 
ple reaction scheme (Fig. 2) where a gene can be in three possible states: G±, G2 and G3. G2 is the 
unregulated state and G\ (G3) is the repressed (activated) state of the gene. The unregulated state 
of the gene which is achieved when both the sites are empty. Activator (repressor) molecules, on 
binding its specific site, help in transition from the unregulated state G2 to the active (repressed) 
state G3 (Gi) of the gene. There are random transitions taking place between the three states 
of the gene. Activator and repressor molecules compete for the state G2 to take control of the 
network. If activator molecule wins, the gene turns into active state and protein synthesis occurs 
with rate constant J p . Protein production does not take place from the unregulated (G 2 ) and 
repressed (Gi) states of the gene. Degradation of proteins occur with rate constant k p and this 
event is independent of the states of the gene. Here transcription and translation are combined 
together into a single step as done in earlier studies [T7] IT9"] . The stochastic transition from G2 
to G 3 occurs with rate constant k a and that from G2 to G\ with fc 2 (Fig. 2). The rate constants 
k a and k 2 are the functions of activator and repressor molecules respectively. Thus, in absence 
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FIG. 1. Schematic diagram of transcriptional regulation by activator and repressor molecules 
where both the molecules compete for their respective binding site. 
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FIG. 2. Reaction scheme with the three states of the gene: repressed (Gi), unregulated (G 2 ) and 
activated (G 3 ). From the activated state G 3 proteins are synthesized with rate constant J p . 



of repressor (activator) molecules the transition from G 2 to G\ (G 3 ) is not possible at all. The 
assumption that there can be three possible states of the gene provides the basis for the minimal 
model of the activator-repressor system. 

Let Pi(n,t) (i = 1,2,3) be the probability that at time t, the gene is in the Gi state with n 
number of protein molecules in the system. The Master equations for the biochemical reactions 
corresponding to the Fig. 2 are given by 



dp!{n,t) 
dt 



k 2 p 2 {n,t) - kip 1 (n,t) + k p [(n + l)pi(n + l,t) - npi{n,t)\ 



(1) 



dp 2 {n,t) 
dt 



hpi(n,t) + k d p 3 (n,t) - k 2 p 2 (n,t) - k a p 2 (n,t) + J [p 2 (n - l,t) - p 2 (n,t)] 



+ k p [(n + l)p 2 (n + l,t)-np 2 (n,t)} (2) 

— 7^— = k a p 2 (n,t)-k d p 3 (n,t) + J p \p 3 (n-l,t)-p 3 (n,t)] + k p [(n+l)p 3 (n+l,t)-np 3 (n,t)] (3) 

Now the standard approach of the theory of stochastic processes will be used to determine 
the steady-state probability density function for protein levels [22] • The generating functions are 
defined as 

t) = £Vpi(M), F 2 (z, t) = J2z n P2(n,t), F 3 (z, t) = £ n z n p 3 (M) and F(z,t) = £V p(n,t) 

n n n 

(4) 

where 

F(z, t) = Fx(z, t) + F 2 (z, t) + F 3 {z, t) 
p(n,t) =px(n,t) +p 2 (n,t) +p 3 (n,t) 

where F(z, t) and p(n, t) are the total generating function and total probability density function 
respectively. 
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FIG. 3. Plot of p(n) versus n for the activator-repressor system for b\ = 16 and four different sets 
of parameter values: long dashed curve: s x — 1, s 2 — 6, s a = 10, s d = 1, for solid curve: s x = 1, 
s 2 = 4, s a = 13, s d = 2, for short dashed curve: Si = 2, s 2 = 6, s a = 5, = 1 and for dotted 
curve: s x = 1.25, s 2 = 6, s a = 10, = 1.25. 



In terms of the generating functions (jl]), Eqs. ([!]), ([2]) and ([3]) can be written as 

dFi(z,t) , _ , s , ^/ >, , .dFi(z.t) 
^ 1 = k 2 F 2 {z, t) - k 1 F 1 (z, t) + k p (l- z) ^ ' ) 
at ''" 



dz 



dF f t ' t] = k l F 1 (z,t) + k d F 3 (z,t) - k 2 F 2 (z,t) - k a F 2 (z,t) + k p (l - z) ^ 2 ^ 

BF-iiz t) 8F?(z t) 

) = k a F 2 (z,t) - k d F 3 (z,t) + J p (z - l)F 3 (z,t) + k p (l - z) ^ ] 



In the steady state 



- at 



dz 

0, i = 1, 2, 3), addition of Eqs. © and (EJ) results 



(6) 

(7) 
(8) 



p r 3{Z) = Kp—Q^~ (9) 

With the help of the Eqs. (JSD, @, © and Q, Fx(z) and F 2 (z) can be expressed in terms of 
F(z). Then, in terms of the generating function F(z), the Eqs. (Q, ([7]) and ([8]) can be written as 

(z - lfF"\z) + { ai (z - 1) - h(z - l) 2 }F"(z) + {a 2 - b x b 2 {z - l)}F'(z) - b l( i 3 F(z) = (10) 

where a x = (1 + s 1 + s 2 + s a + s d ), b x = J p /k p , a 2 = sxs a + s 1 s d + s 2 s d , b 2 = a x - s d , a 3 = sts a , 
si = h/k p , s 2 = k 2 /k p , s a = k a /k p and s d = k d /k p . 

The solution of the Eq. (fTUj) is a generalized hypergeometric function and is given by 

F(z) = CpFglg! - g 2 ; g x + g 2 ; hi - h 2 ; h x + h 2 ; b x {z - 1)] (11) 



where g x = -\ + |, g 2 = \ y/(b 2 - l) 2 - 4a 3 , hi = -\ + , /i 2 = \^ {a x - I) 2 - Aa 2 , C is the 
normalization constant and p F q (a,b,c,d) is the generalized hypergeometric function (GHF). The 
normalization constant can be determined easily from the condition -F(l) = 1. 

Differentiating Eq. (TTTj) n times w.r.t. z at z = 0, one can easily obtain the expression for the 
steady-state probability density function p{n) as 

( \ n K^{9^n)Y{g 2 ^n)Y{h x )Y{h 2 ) 
V\n) = C p F q {g 1 + n; g 2 + n; h x + n; h 2 + n; -61) 12 

n\ T(hi + n) T(h 2 + n) T(g 1 ) T(g 2 ) 
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FIG. 4. Plot of p(n) versus n obtained from stochastic simulation using Gillespie algorithm with 
the rate constants k\ = 1, k 2 — 6, k a = 10, ka = 1, J p = 16 and k p = 1 (same as the long dashed 
curve of Fig. 3). For k p = 1, Sj = ki (i — 1,2, a, d) and b\ = J p . 



The plot of p(n) versus n for different values of (i = 1, 2, a, d) with 6i = 16 is shown in Fig. 
3. Different curves in Fig. 3 show that the distributions of protein levels are bimodal in different 
parameter regions with s» (i — 1, 2, a, d) ^ 1. The binary response can also be observed in a region 
of parameter values with Si (i — 1, 2, a, d) < 1 (not shown). The binary responses in the activator- 
repressor system cannot be observed for sj, ^ 2 (simultaneously). Figure 4 shows the binary 
response in protein levels obtained from stochastic simulation using Gillespie algorithm [23j for 
the biochemical reactions shown in Fig. 2 for the rate constants k\ — 1, k 2 = 6, k a = 10, ka = 1, 
J p = 16 and k p = 1 (For fc p = 1, s< = fc« (i = 1, 2, a, d) and b\ = J p ). 

To understand the origin of bimodal distribution in protein levels in the present scenario we 
calculate the components of probability density function pi(n)(i = 1,2,3) in the steady state. 
Using Eqs. ([9]) and (llip one can easily obtain ps(n) and is given by 

P3W = ^ n! + n + 1) ffS + n + 1) T(g 2 ) ^+n+l; g 2 +n + l; h 1+ n+l; h 2+ n+l; -b Xi 

(13) 

In the steady state, differentiating Eq. (JHJ) n times w. r. t. z at z — we have 

(srf + fti) , , b n l +l T{g l + n + 2)T{g 2 + n + 2)T{h l )T{h 2 ) 
p 2 {n) = p 3 (n) - 



s a ' ' k p s a n\T(h 1 + n + 2)T(h 2 + n + 2)T(g 1 )T(g 2 ) 



pFgigt + n + 2; g 2 + n + 2; h x + n + 2; h 2 + ti + 2; (14) 

From Eq. ([5]) we have 

Pi(n) = p{n) - p 2 (n) - p 3 (n) (15) 

where p 2 (n) and pz{n) are obtained from Eqs. ( fl~3l) and ( 1T41 respectively. 

Figure 5 shows the plot of total and component probability density functions p(n) and pi(n)(i = 
1, 2, 3) respectively versus n, the number of proteins, for the rate constants Si = 1, s 2 = 6, s a = 10, 
Sd = 1 and &i = 16 (same as the dotted curve in Fig. 3). The bimodal distribution in protein 
levels is clearly the resultant of three unimodal functions Pi(n) (i = 1,2,3) (Fig. 5). This is also 
true for other bimodal curves in Fig. 3. From the rate constants used to obtain the bimodal 
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FIG. 5. Plot of p(n) and Pi{n) (i = 1, 2, 3) versus n for the rate constants si = 1, s% — 6, s a — 10, 
Sd = 1 and &i = 16. The bimodal nature of the function p(n) is the resultant effect of three 
unimodal functions Pi(n)(i = 1, 2, 3). 

distributions in Fig. 3, it is clear that the probability of occurrence of the gene in the G\ and 
G3 states are higher. Once it is in the G\ state, the probability of transition to G 2 state is lower 
because of the lower value of s\. On the other hand, at any instant of time, if the gene is in the 
G2 state, there can be two possibilities: gene can switch either to the active state (G3) or to the 
repressed state (G\) depending on the amount of activators or repressors present in the system 
since activator molecules modulate the transition from G 2 to G 3 state and repressor molecules 
modulate the transition from G 2 to G\ state. Higher values of s a and s 2 make the G3 and G\ 
states more probable than G2 and due to the lower values of Si and Sd, the gene spends most of 
the time either in the G3 or G\ state. Once the gene is in the G3 state, the transition to G 2 state 
is rare because of the lower value of Sd- From the G3 state of the gene, proteins are synthesized 
with rate constant J p and there is enough time for the protein level to reach the steady value. 
This gives rise to high protein level in single cell and the peak in the distribution of protein level 
at higher value. This is clearly observed in the curve for p^{n) in Fig. 5. Now if the gene switches 
suddenly to G 2 state then protein level starts to decrease. The protein level keeps on decreasing 
as long as the gene is in the G 2 state or switches to the G\ state. If the gene switches to the G\ 
state the protein level decreases and reaches zero value. This gives rise to low/zero protein level 
in a single cell and the peak in the distribution of protein level occurs at low/zero value. This is 
observed in the curve for pi(n) in Fig. 5. From the G 2 state, the gene can also switch back to the 
G3 state and this causes a rise of protein level again from an intermediate value. Therefore, there 
is a finite probability to observe the protein level at the intermediate value. The curve for p 2 (n) 
shows a finite value at the intermediate region of protein level (Fig. 5). 

Rossi et al. examined whether an interplay of transcription factors can convert a graded 
to binary response in gene expression |12| . They designed an experiment in which the ratio of 
activator and repressor molecules that bind to the same promoter can be modulated by a single 
inducer molecule dox. Furthermore, the activator and repressor molecules bind the overlapping 
binding sites on the same promoter mutually exclusively. They analyze the graded and binary 
responses to the inducer molecule by flow cytometery in large population of individual cells. Three 
different cell populations viz. a dox regulated repressor ("repressor only"), a dox regulated activator 
("activator only") and both ("activator+repressor") were generated in the experiment to study the 
role of positive and negative transcription factors. The flow cytometric analysis of the activator 
only and repressor only cell populations revealed a graded response (unimodal distribution) of 
GFP expression at all dox concentrations. The binary response (two distinct sub-populations) 
was observed in cells containing both activator and repressor molecules for a range of intermediate 
dox concentrations. With increasing dox level, the increase in the number of cells with maximal 
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level of GFP and decrease in the number of cells with low GFP level is observed. Therefore, an 
all-or-none (binary) response to the inducer level is observed in the experiment of Rossi et al. 
[12] when a combination of activator and repressor molecules act on the same promoter mutually 
exclusively. Moreover, since either factor independently produces a graded response, the binary 
response observed in cells with both the regulatory molecules is not due to a dominant effect of 
one factor over the other but rather to their combined effect. 

Our theoretical analysis of the activator-repressor system does not explicitly include the acti- 
vator and repressor numbers in the equations but are included in the rate constants s a and S2- The 
rate constant s a increases with the increase in activator amount and S2 increases with the increase 
in repressor amount. Now, let us assume that the numbers of both molecules can be controlled by 
a single inducer molecule like dox, as in the experiment of Rossi et al. [12], so that s a increases 
and S2 decreases with the increase of dox. Depending on the presence of regulatory molecules, the 
gene regulatory network can be divided into three categories: activator-only system (i.e., only ac- 
tivator molecules regulate the network), activator-repressor system (i.e., activators and repressors 
both regulate the network) and repressor-only system (i.e., only repressor molecules regulate the 
network). In presence of only activator molecules the three-state gene activation process reduces 
to the two-state one. Random switching then takes place only between G2 and G% states. With 
the two-state gene activation process, the graded and binary responses are observed for s a , Sd > 1 
and s a , Sa < 1 respectively [19]. In one hand, with Sd > 1 if s a is varied from low to high value then 
unimodal responses are observed. On the other hand, with < 1, if s a is varied from low to high 
value then first unimodal (for s a < Sd), then bimodal (for s a ~ Sd) and then again unimodal (for 
s a > Sd) responses are observed [19J. Rossi et al. observed graded responses in activator-only sys- 
tem at all levels of inducer. To reproduce the experimental observations of Rossi et al. we choose 
the parameter region s a ,Sd > 1- Let us assume that initially there are only activator molecules 
(with low copy number) activating the gene transcription and Sd is fixed at 1.25. Now, with the 
gradual increase of inducer molecules dox in the system, s a increases and the mean protein level 
also increases gradually. The probability distributions always remain graded (curves in the left col- 
umn of Fig. 6) because the values of s a and Sd satisfy the condition of unimodal/graded response 
(s a ,Sd > 1) for all values of dox [19]. Let us now consider the same regulatory network (same 
Sd) but with repressor molecules also present in the system. The gene can now switch between 
all three possible states and both the molecules compete for their binding site to take control of 
the gene transcription. Let us assume that initially there are large number of repressors i.e., S2 is 
large and small number of activators i.e., s a is low. With the gradual increase of dox molecules 
in the system, S2 decreases and s a increases gradually and simultaneously i.e., inhibition effect 
decreases and activation effect increases simultaneously. This causes the conversion of unimodal 
(for low dox i.e., low s a and high S2) to bimodal (for intermediate dox i.e., intermediate s a and S2) 
and then again unimodal (high dox i.e., high s a and low S2) distribution of protein levels (right 
column of Fig. 6). The gradual increase in the inducer level causes a discontinuous change in the 
mean protein level. Therefore, the response is bimodal/binary as the mean protein level is not a 
continuous function of inducer but has only low and high values. These results (Fig. 6) are in 
qualitative agreement with the experimental observations of Rossi et al. [12] for activator-only and 
activator-repressor systems. 

Rossi et al. [12] also observed the graded response when only repressor molecules regulate 
the gene transcription. To reproduce the experimental observation for repressor-only case in the 
present scenario, one has to consider the basal rate of protein synthesis from the unregulated 
state of the gene (G2). With the basal rate of protein synthesis, say Jo (Jo < Jp)-, from the 
G2 state, the generation of graded response for repressor-only case is quite similar to that of the 
activator-only case discussed above. In the presence of only repressor molecules in the system, the 
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FIG. 6. Distribution of protein levels p(n) versus n in activator-only system (left column) and 
activator-repressor system (right column) for different level of inducer molecules i.e., for different 
values of s a and s 2 - For the activator-only system s 2 is kept fixed at 1.25 and s a is varied as 
mentioned on the top of the figures (figures in the left column). For the activator-repressor system 
Si and sa are kept fixed at 1.25 and 1.25 (s^ is same as in the activator-only system) respectively 
and the different curves are drawn for different values of s a and s 2 mentioned on the top of the 
figures (figures in the right column). For all curves the relative transcription rate constant b\ = 16. 
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three-state gene activation process reduces to the two-state one i.e., the gene can switch randomly 
only between G\ and G2 states. Repressor molecules help in transition from G2 to G\ state. With 
the finite basal rate of protein synthesis from the G2 state the response will be graded for Si > 1 
and for all values of S2 [E]. The initial value of the rate constant S2 is large due to the presence 
of large number of repressor molecules in the repressor-only system, the response in this case will 
be unimodal since Si = 1.25 and S2 is large. Now with the gradual increase of dox concentration, 
the rate constant S2 decreases from a high to a low value but the response still remains graded 
due to s% being greater than one (si = 1.25). Therefore, with the basal rate of protein synthesis 
from the unregulated state the graded response can also be observed for repressor-only system. 
But with the basal rate of protein synthesis from the state G2 the derivation of exact analytical 
expression for the probability density function of protein levels for the activator-repressor system 
is very difficult. Though, with the help of stochastic simulation using Gillespie algorithm, it can be 
shown that the finite basal rate of protein synthesis from the state G2 does not change our results 
qualitatively. The qualitative nature of the curves drawn in Figs. 3 and 6 will remain unchanged 
with the basal rate of gene expression from G2 taken into account. 

III. DISCUSSION 

In this paper, we have studied an gene regulatory network where the positive and negative 
transcription factors regulate the gene transcription mutually exclusively. Both the molecules 
compete for their respective binding sites on the DNA to take control of the network (Fig. 1). The 
activator-repressor system is represented by a simple stochastic model where gene can be in three 
possible states viz. inactive/repressed, unregulated and active. An exact analytical expression 
for the probability density function of the protein levels in the steady state is derived and is a 
generalized hypergeometric function (GHF) (Eq. (1T21)). From the GHF, the bimodal distribution 
in protein levels is observed in a wide region of the parameter values. From the theoretical analysis, 
the experimental observation of Rossi et al. (i.e., the regulation only by activator molecules 
produce the graded response in the protein levels whereas binary responses are observed when 
both the activator and repressor molecules regulate the gene transcription by binding the promoter 
mutually exclusively) can be reproduced very easily. Here we have considered only the parameter 
region Si(i = 1,2, a, d) > 1 (Fig. 3). The binary response in protein level is more prominent 
for Si(i = 1,2, a, d) < 1 (not shown). This region is excluded from the present analysis because 
with Si(i = 1,2, a, d) < 1, the binary response can also be observed for two-state activator-only 
system [19J. But the experiment of Rossi et al. [12] observed only graded response when only 
activator molecules regulate the gene transcription. This experimental observation along with the 
theoretical prediction of graded response in gene expression [19j helps us to choose the parameter 
region for theoretical analysis. Rossi et al. observed the graded response also for repressor-only 
system. Here we have not considered the repressor-only case because this requires a basal rate of 
protein synthesis from the unregulated (G2) state. The basal rate of protein synthesis from the 
unregulated state brings difficulties in the analytical tractability of the model. In the presence 
of the basal rate of protein synthesis from the unregulated state, it is very difficult to express 
the components of the generating functions Fi(z) (i = 1,2,3) in terms of the total generating 
function F(z) and therefore the Chemical Master Equations (CME) cannot be expressed by a 
single differential equation like Eq. ffTUl) . Again, the reduction of the CME into a single differential 
equation does not lead to the exact solution of CME because of the unavailability of the analytical 
solution of the higher order differential equation. This shows the limited scope and applicability of 
the generating function technique used here to solve the CME. The difficulty increases when the 
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regulatory networks consist of nonlinear feedback loops. 

In the present analysis of activator-repressor system, we have combined the transcription and 
translation into a single step process. In the process of transcription mRNAs are produced from the 
active gene and then mRNAs are translated into proteins. Therefore, the steady state probability 
distribution (Eq. [12"]) derived here gives the correct description for mRNAs. The distribution 
in protein levels does not follow the bimodal mRNA distribution when the protein lifetime is 
longer than that of mRNA [21]. Despite the above limitations of the stochastic model, it contains 
important features necessary for an explanation of the binary response in an activator-repressor 
system and is graded in activator only system as observed in experiment |12| . The exact analytical 
result with three gene states is important and useful specially in the eukaryotic system. The gene 
activation of the complex eukaryotic system consists of many unknown number of rate limiting 
steps (chromatin remodeling, assembly of preinitiation complex etc.). The simplification of the 
complex gene activation process by the two-state one is the first approximation of the complicated 
biological process. The 'three-state' assumption may be considered as the second approximation 
of the stochastic gene activation-deactivation process. 

The present analysis of the origin of binary responses in three-state model may be helpful 
to explain the bimodal distribution in transcriptional silencing [25j. In transcriptional silencing, 
Sir proteins (Sir 2-4) are the key structural components of silenced chromatin and under their 
regulation the silencer can be in two possible states: repressed and derepressed. The silencer 
helps to assemble the Sir protein complex. This process of assembling is not a single step process 
but rather consists of several reversible biochemical steps. The intermediate steps between the 
repressed and derepressed states of the chromatin make the effective rates of transitions very slow 
and these slow rate of transitions ultimately may lead to the bimodal distribution of protein levels 
from reporter gene. 
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